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INTRODUCTION 

Background 

The  expression  of  estrogen  receptor  (ER)  is  intimately  associated  with  the  biology  of 
breast  carcinoma.  Breast  carcinomas  occurring  in  postmenopausal  women  are  often  ER- 
positive  ^  and  many  of  these  tumors  express  significantly  more  receptor  than  normal 
mammary  epithelium  ER-negative  breast  carcinomas  are  more  likely  to  occur  in  young 
women  and  these  tumors  carry  a  worse  prognosis  than  carcinomas  which  express  ER 
Several  studies  have  focused  on  the  function  of  ER  in  an  attempt  to  explain  the  association 
between  ER  expression  and  tumor  biology.  Mutations  have  been  described  in  the  ER  gene 
of  some  breast  carcinomas  that  render  these  altered  ER  proteins  incapable  of  binding 
estrogen  response  elements  (ERE)  ^  and  able  to  inhibit  wild-type  ER  function 
Other  studies,  however,  have  found  ER  mutations  which  result  in  a  constitutively  active 
receptor  which  has  also  been  postulated  as  important  to  the  development  of  hormone- 
independent  growth  If  ER  function  is  influencing  the  oncogenic  process,  it  is 

difficult  to  conceptualize  within  a  single  model  of  oncogenesis  the  occurrence  of  mutations 
which  inhibit  ER  function  and  mutations  which  result  in  constitutive  activity.  An 
alternative  hypothesis  is  that  mechanisms  regulating  transcription  of  the  ER  gene  influence 
the  phenotype  of  breast  carcinoma;  within  this  model,  ER-negative  carcinomas  which  do 
not  transcribe  the  ER  gene  define  a  subset  of  tumors  with  a  more  aggressive  phenotype. 
This  theory  is  supported  by  recent  studies  which  have  identified  breast  carcinoma  cell  lines 
that  fail  to  transcribe  an  apparently  normal  ER  gene  ^  ^ .  It  is  therefore  possible  that 
defining  molecular  mechanisms  controlling  transcription  of  the  ER  gene  may  provide  new 
insight  into  the  biology  of  breast  carcinoma. 

Transcription  of  ER  occurs  from  a  number  of  different  promoters,  with  at  least  4 
different  transcripts  identified  to  date  12, 13  pj  is  the  major  ER  transcriptional  start  site 
I"!  and  is  predominantly  utilized  in  human  mammary  epithelial  cells  (HMEC)  and  is  the 
major  start  site  in  ER-positive  human  breast  carcinomas  1^.  Multiple  cap  sites  have  been 
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identified  for  the  PO  promoter.  Studies  of  the  murine  ER  gene  identified  10  cap  sites 
spanning  approximately  60  bases  and  a  start  site  at  -1994  (from  the  PI  cap  site)  was 
identified  in  human  cells  which  would  agree  closely  with  the  major  murine  PO  cap  site 
Transcription  from  the  PO  promoter  is  characteristic  of  human  endometrial  tissue  and  can 
account  for  12  to  33%  of  ER  transcription  in  breast  carcinoma  cells  Recently,  several 
novel  transcription  start  sites  have  been  identified,  including  two,  Pp  and  Ph,  which  splice 
into  +163  of  the  untranslated  leader  sequence  as  does  the  PO  transcript  In  transcript 
PE,  the  sequence  upstream  from  the  +163  splice  site  corresponds  to  -359  to  -169  (relative 
to  the  PI  start  site),  a  region  previously  described  as  part  of  the  intron  between  PO  and  PI. 
The  Ph  cap  site  has  been  shown  to  be  at  least  20  kb  upstream  of  the  PI  promoter, 
suggesting  the  existence  of  an  alternate  promoter  for  ER. 

Purpose  of  Present  Work 

The  experiments  described  herein  address  the  technical  objectives  1,2,3,  and  4  of  the 
U.S.  Army  Research  Grant  DAMD17-94-J-4353.  In  this  study,  we  performed  a  functional 
mapping  of  the  ER  promoter  region  from  3.5  kb  upstream  of  the  PI  start  site  to  the 
translation  start  site  and  found  a  region  in  the  5'  untranslated  leader  sequence  that  controlled 
ER  transcription  in  ER-positive  breast  carcinoma  cell  lines.  This  region  contained  two 
binding  sites  for  a  transcription  factor.  Estrogen  Receptor  Factor- 1  (ERF-1).  ERF-1  was 
identified  by  gel  shift  assay  and  was  present  in  ER-positive  breast  and  endometrial 
carcinoma  cells,  but  absent  in  ER-negative  cell  lines,  thus  inferring  a  role  for  ERF-1  in  the 
regulation  of  ER  transcription  in  breast  carcinoma.  Mutational  analysis  of  the  ERF-1 
binding  site  showed  that  ERF-1  binding  was  correlated  with  transcriptional  activity.  ERF- 
1  was  purified  from  MCF7  breast  carcinoma  cells  using  ion-exchange  and  DNA  affinity 
chromatography  and  the  cDNA  was  isolated  from  a  MCF7  expression  cDNA  library. 
ERF-1  was  identified  as  AP2y,  a  member  of  the  AP2  transcription  factor  family.  When 
binding  specificity  of  AP2y  was  compared  to  AP2a  using  PCR-Assisted  Binding  Site 
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selection  and  competitive  gel  shift  assay,  it  was  found  that  the  two  proteins  recognized  the 
same  DNA  sequence.  Cotransfection  of  AP2a  or  AP2y  with  an  ER  promoter  reporter 
construct  demonstrated  that  activation  of  the  ER  promoter  is  dependent  upon  cell  type. 

MATERIALS  AND  METHODS 
Cell  lines 

HMEC  were  obtained  from  reduction  mammoplasties  and  maintained  in  DFCI-1 
growth  medium  as  described  previously The  ECC-1  ER-positive  human  endometrial 
carcinoma  cell  line  was  obtained  from  P.  G.  Satyaswaroop,  Hershey,  PA.  All  other  cell 
lines  were  obtained  from  American  Type  culture  Collection,  Rockville,  MD.  Cells  were 
maintained  in  minimal  essential  medium  (Gibco  BRL,  Gaithersburg,  MD)  supplemented 
with  10%  fetal  bovine  sera  (Hyclone,  Logan,  UT),  25  mM  4-(2-hydroxyethey)- 
Ipiperazineethanesulfonic  acid  (Hepes),  26  mM  sodium  bicarbonate,  5,000  units/ml 
penicillin  G  (Gibco  BRL),  5,000  ug/ml  streptomycin  (Gibco  BRL),  and  6  ng/ml  bovine 
insulin  (Sigma  Chemical  Company,  St.  Louis,  MO).  Cells  were  incubated  at  37°C  in  5% 
C02. 

Plasmid  construction  and  transfection 

The  region  of  DNA  in  the  5'  end  of  the  ER  gene  was  isolated  from  a  human  genomic 
lambda  library  (Stratagene,  La  Jolla,  CA).  The  5'  end  was  sequenced,  and  a  set  of  primers 
was  constructed,  all  of  which  contained  a  BglU  site  for  use  as  upstream  primers.  All  3' 
oligonucleotides  contained  a  HindBl  site.  These  oligonucleotide  primers  were  used  in  PCR 
with  cloned  DNA  as  the  template.  Primers  used  to  generate  5'  deletion  constructs  were 
ER724  (TACAGATCTGTGGTCCAACATAAACACA),  ER586  (TGCAGATCTTCCTAT 
ATGTATACCC),  ER464(CATTAGATCTGCCCTATCTCGGTTACAGTGT),  ER375(G 
GGGAGATCTAACAGAAAGAGAGACAA),  ER289  (CCCTAGATCTGTCTTTCGCGT 
TTAT),  ER122  (GGGAGATCTGCCTGGAGTGATG  TTTAAG),  ER40  (TATGAGATC 
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TGGAGACCAGTACTTAAAG),  and  ERO  (CCCAGATCTGGCGGAGGGCGTTCG). 
Primers  used  to  generate  specific  3'  ends  were  +230  (CATAAGCTTGGTCCGTGGCCG 
CGGGCAGGGT),  +210  (CGGGAAGCTTGCAGACCGTGTCCCCGCAGG),  +135  (G 
CCCAAGCTTAGAGGCGACGCAGCGCA),  and  +0  (CGCCAAGCTTCCTGGGCTCC 
CGGGCCTC). 

PCR  products  were  then  subcloned  into  the  BglW-Hindlll  sites  of  the  luciferase 
reporter  plasmid  pGL2-Basic  (Promega,  Madison,  Wis.).  To  construct  the  plasmids  with 
3,500  bp  upstream  of  PI,  the  5'  region  was  first  subcloned  into  pBluescript  from  the 
genomic  lambda  clone  as  a  SaR-Not\  fragment,  using  the  SaR  site  in  lambda  DASH  and  the 
Notl  site  in  the  first  exon  of  ER.  The  ER724-xLUC  constructs  were  each  digested  with 
Xho\-Nde\,  removing  the  5'  region  of  the  ER  gene.  This  region  was  then  replaced  with 
the  larger  5'  flanking  region  as  an  Xhol-Ndel  fragment  from  the  ER  gene  cloned  in 
pBluescript.  In  this  construct,  the  Xhol  site  6  bp  upstream  of  the  SaR  site  in  pBluescript 
was  used  and  was  inserted  into  the  Xhol  site  4  bp  upstream  of  the  BgRl  site  in  pGL2- 
Basic.  Each  of  these  plasmids  then  contained  3,500  bp  upstream  of  PI . 

Plasmid  DNA  was  prepared  by  alkaline  lysis  and  purified  by  using  polyethylene 
glycol  as  described  previously  All  cells  were  transfected  by  the  calcium  phosphate 
precipitation  procedure  In  these  experiments,  30  pg  of  cloned  plasmid  DNA  was  used 
in  transfections  of  100-mm-diameter  plates  with  cells  at  50  to  60%  confluence.  In  each 
transfection,  2  pg  of  a  Rous  sarcoma  virus-driven  chloramphenicol  acetyltransferase  (CAT) 
expression  vector  was  cotransfected  and  cells  were  assayed  for  luciferase  and  CAT 
expression  48  h  after  transfection.  Values  presented  are  luciferase  units  as  measured  on  a 
luminometer  corrected  for  transfection  efficiency,  as  determined  by  CAT  assay.  Similar 
results  were  obained  when  15  pg  of  plasmid  DNA  was  used  in  transfections. 
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Preparation  of  Whole  Cell  Extracts 

Cells  were  collected  by  trypsinization.  Cell  pellets  were  washed  with  1  x  PBS  and 
then  resuspended  at  10^  cells/ml  in  microextraction  buffer  [450  ruM  NaCl,  20  mM  HEPES, 
pH  7.7,  25%  glycerol,  0.2  mM  EDTA,  0.5  mM  DTT,  and  proteinase  inhibitors].  Cells 
were  sonicated  and  cellular  debris  was  pelleted  at  14,000  rpm  in  a  microfuge.  Protein 
concentration  of  the  supernatant  was  determined  using  a  Bio-Rad  protein  assay  (Hercules, 
CA)  and  ran  in  the  range  of  5-15  mg/ml.  Extracts  were  stored  at  -80°C  until  use. 

Gel  Shift  Assay 

Gel  shift  assays  were  performed  using  15  ug  nuclear  cell  extract,  purified 
chromatographic  fractions  or  in  vitro  translated  protein.  Protein  was  incubated  in  lx 
binding  buffer  (40  mM  KCl/  20  mM  Hepes,  pH  7.9/  1  mM  MgCl2/  0.08  mM 
ethyleneglycol-bis-(6-aminoethyl)-N,  N,  N',  N’  tetraacetic  acid  (EGTA)/  0.4  mM  DTT), 
4%  Ficoll,  12  ug/ml  poly  dl-dC  (Pharmacia),  1  mg/ml  Bovine  Serum  Albumin  (BSA) 
(Sigma)  and  0.2  ng  of  radiolabelled  double-stranded  oligonucleotide  probe  in  a  volume  of 
25  ul  at  room  temperature  for  1  hour.  Reactions  were  electrophoresed  on  a  4%  acrylamide 
in  0.25X  TBE  at  4°C.  Gels  were  dried  and  autoradiographed  O/N.  In  competitive  binding 
assays,  unlabeled  mutant  oligonucleotides  were  added  at  1,000  fold  excess.  Supershift 
assays  were  performed  by  the  addition  of  2  ug  of  affinity-purified  rabbit  polyclonal 
antibody  directed  against  human  AP2  (Santa  Cruz  Biochemicals)  or  with  2  ul  of  polyclonal 
La/SS-B  antisera  obtained  as  a  gift  from  Michael  Bachmann  (Mainz,  Germany).  Probe 
was  prepared  by  end-labeling  the  wild-type  30-bp  double-stranded  ERF-1  binding  site  with 
[Y-32p]ATP  (3,000  Ci/mmol)  using  T4  polynucleotide  kinase,  followed  by  gel  purification 
on  15%  polyacrylamide.  Mutant  oligonucleotides  used  as  competitors  were  synthesized  to 
contain  specific  point  mutations  in  relation  to  the  wild-type  ERF-1  sequence. 
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Purification  of  ERF-1 

MCF7  nuclei  were  prepared  as  previously  described^  Nuclei  were  resuspended  in 
0.6  times  packed  nuclear  volume  (PNV)  of  buffer  D  (10  mM  Tris,  pH  7.9/100  mM  KCl/2 
mM  MgCl2/0.1  mM  EDTA/1  mM  DTT/  1  mM  NaMetaBis/0.2  mM  phenylmethylsulfonyl 
fluoride)  and  Dounce  homogenized  on  ice  with  10  strokes.  Homogenized  nuclei  were 
mixed  with  0.06X  PNV  of  4  M  ammonium  sulfate  (pH  7.9)  and  extracted  with  gentle 
mixing  at  4°C  for  1  h.  Extracted  nuclei  were  pelleted  by  centrifugation  at  25,000  x  g,  4°C 
for  20  min  and  the  supernatant  was  dialyzed  for  5.5  h  against  5  liters  of  D-lOO  buffer  (20 
mM  Hepes,  pH  7.9/20%  glycerol/  100  mM  KCl/2  mM  MgCl2/0.2  mM  EDTA/1  mM  DTT/ 
1  mM  NaMetaBis/0.2  mM  phenylmethylsulfonyl  fluoride).  After  dialysis,  the  extract  was 
centrifuged  for  20  min  at  16,000  x  g,  4°C.  Protein  concentration  of  nuclear  extract  was 
determined  using  a  Bio-Rad  protein  assay  system  (Bio-Rad)  and  was  generally  in  the  range 
of  7- 15  mg/ml. 

ERF-1  activity  was  monitored  throughout  the  purification  by  gel  shift  assay.  MCF7 
nuclear  extract  (375  mg)  was  applied  to  a  30  ml  Q  Sepharose  Fast  Flow  (Pharmacia)  anion 
exchange  colunm  at  100  mM  KCl  and  eluted  with  1  volume  of  0.15  M  KCl.  This  fraction 
was  applied  to  a  10  ml  Heparin  Sepharose  CL-6B  (Pharmacia)  column,  washed  with  5 
volumes  of  0.35  M  KCl  and  eluted  with  2.5  volumes  of  0.6  M  KCl.  This  0.6  M  fraction 
was  diluted  to  0.1  M  KCl,  applied  to  a  5  ml  DNA  cellulose  (native  DNA;  Pharmacia) 
column,  and  eluted  with  2.5  volumes  of  0.4  M  KCl.  This  fraction  was  diluted  to  0.15  M 
KCl  and  nonbinding  mutant  16  (TGAGCCTTCTGCGGTGCGGGGACACCGTCT)  was 
added  at  10  ug/ml,  incubated  on  ice  10  min  and  centrifuged  at  12,000  x  g  for  10  min  at  4°C 
to  clear  any  precipitated  material.  The  protein  was  divided  in  half  and  applied  to  two  1  ml 
DNA  affinity  columns.  The  columns  were  washed  with  5  volumes  of  0.5  M  KCl  and 
eluted  with  5  volumes  of  0.8  M  KCl.  The  ERF-1  froaction  was  incubated  with  additional 
mutant  oligonucleotide  (10  Mg/ml)  and  passed  a  second  time  over  a  single  1  ml  DNA 
affinity  column  as  describe  above.  All  fractions  were  quick  frozen  in  liquid  nitrogen  and 
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stroed  at  -80°C  for  subsequent  analysis.  The  DNA  affinity  column  was  prepared  by 
attaching  a  double-stranded,  biotinylated  30-mer  oligonucleotide  (TGAGCCTTCTGCCCT 
GCGGGGACACGGTCT)  corresponding  to  the  wild-type  ERF-1  binding  site  to 
streptavidin  agarose. 

Protein  fractions  were  concentrated  by  trichloroacetic  acid  precipitation  as 
described^^.  Protein  pellets  were  resuspended  in  standard  Laemmli  SDS  loading  buffer, 
boiled  3  min,  and  electrophoresed  on  8%  SDS/PAGE  with  standard  protein  molecular 
weight  markers.  Proteins  were  visualized  using  the  Silver  Stain  Plus  kit  (Bio-Rad). 

Protein  Sequencing 

Protein  was  separated  by  SDS/PAGE,  stained  with  Coomassie  brilliant  blue,  and 
individual  protein  bands  excised.  After  in  gel  tryptic  (Promega)  digestion  as  described 
without  the  addition  of  0.02%  Tween.  Candidate  ions  for  sequencing  were  determined  by 
microcapillary  high  performance  liquid  chromatography  coupled  to  a  triple  mass 
spectrometer  (model  TSQ7000  with  electrospray  ionization  source;  Finnigan,  San  Jose, 
CA)  as  described  The  ion  densities  observed  corresponded  to  a  load  of  40-100  fmol  by 
comparison  with  average  ion  abundance  of  a  BSA  standard  protein  digest.  Direct  peptide 
sequence  information  was  obtained  by  collisionally  induced  dissociation  on  an  equivalent 
injection  of  the  digest  mixture.  The  resulting  MS/MS  spectra  were  manually  interpreted. 
The  data  base  searching  algorithm  SEQUEST  was  also  used  to  facilitate  interpretation  of 
MS/MS  spectra. 

UV  crosslinking 

DNA/protein  crosslinking  of  the  ERF-1  complex  was  performed  as  described 
using  15  Mg  of  MCF7  nuclear  extract.  Radiolabeled  probe  was  prepared  as  described  by 
annealing  1  pmol  of  an  oligonucleotide  encompassing  the  ERF-1  binding  site 
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(CCGTGTCCCCGCAGGGCAGAAGGCTCA)  (Operon  Technologies)  with  100  pmol  of 
a  complimentary  oligonucleotide  (TGAGCCTTCT)  (Ana-Gen  Technologies). 

Renaturation  of  ERF-1  Activity  from  SDS/PAGE 

Purified  extract  was  resolved  on  8%  SDS/PAGE  after  which  the  gel  was  segmented 
into  4  mm  slices  ranging  from  3 1  to  220  kDa.  Each  slice  was  crushed  in  0.4  ml  elution 
buffer  (0.15  M  NaCl/0.1%  SDS/0.05  M  Tris-HCl,  pH  7.9/0. 1  mM  EDTA/5  mM  DTT/0.1 
mg/ml  BS  A)  and  protein  was  eluted  for  1  h  at  room  temperamre,  followed  by  precipitation 
with  4  volumes  of  cold  acetone.  The  pellet  was  washed  with  cold  80%  acetone. 
Precipitated  protein  was  dissolved  in  0.5  ml  of  8  M  urea  in  D-lOO  buffer  and  incubated  at 
4°C  for  30  min.  The  protein  was  renamred  by  dialysis  against  1  liter  of  1  M  urea  in  D-lOO 
buffer  for  3  h,  followed  by  two  changes  of  D-lOO  buffer  for  3  h  and  12  h,  respectively. 

Cloning  of  ERF-1  from  an  MCF7  cDNA  Library 

Primers  AACCCTCTGAACCTCCCCTGTCAGAAG  and  CCGGTCTTGGCTGAG 
AAGTTCTGTGAATTC  (Operon  Technologies)  were  used  to  amplify  a  504-bp  fragment 
of  the  ERF-1  mRNA  from  MCF7  using  the  GeneAmp  RNA  PCR  kit  (Perkin-Elmer).  This 
fragment  was  random  primed  labeled  using  [a-^^pj^CTP  for  use  as  a  probe  to  retrieve  a 
full-length  ERF-1  cDNA  clone  from  an  MCF7  expression  library  prepared  using  the  Zap 
Express  cDNA  Synthesis  kit  (Stratagene).  Clones  were  excised  from  X  phage  as 
phagemids  and  were  analyzed  by  restriction  analysis.  ERF-1  cDNA  was  sequenced  using 
an  Applied  Biosystems  automated  sequencer. 

In  Vitro  Transcription  and  Translation 

ERF-1  was  in  vitro  translated  from  ERF-1  cDNA  using  the  TnT  Coupled 
Reticulocyte  Lysate  System  (Promega).  Unlabeled  ERF-1  was  translated  in  vitro  by  using 
a  complete  amino  acid  mix  in  place  of  [35s]methionine.  For  immunoprecipitations,  15  u\ 


12 


PROPRIETARY  DATA 


of  in  vitro  translated  ERF-1  protein  was  incubated  on  ice  for  10  min  with  2  ug  of  affinity- 
purified  rabbit  polyclonal  AP2  antibody  in  a  200  u\  volume  of  RIPA  buffer  (150  mM  NaCl/ 
50  mM  Tris,  pH  8.0/ 1%  Nonidet  40/  0.5%  deoxycholate/  0.1%  SDS).  Fifty  microliters  of 
anti-rabbit  IgG  (whole  molecule)  agarose  (Sigma)  was  added  to  the  reaction  and  incubated 
on  ice  for  15  min.  Immunoprecipitated  protein  was  pelleted  by  centrifugation  at  1,000  x  g 
at  4°C  for  5  min,  washed  once  with  RIPA  buffer  and  twice  each  with  buffer  1  (150  mM 
NaCl/ 10  mM  Tris,  pH  7.5/  2  mM  EDTA/  1%  Nonidet  P-40)  and  buffer  2  (500  mM  NaCl/ 
10  mM  Tris,  pH  7.5/  2  mM  EDTA),  and  the  final  pellet  was  resuspended  in  standard 
Laemmli  SDS  loading  buffer  for  SDS/PAGE  analysis. 

PCR-Assisted  Binding-Site  Selection 

The  technique  of  PCR-Assisted  Binding-Site  selection  was  used  to  determine  the 
target  specificity  of  AP2a  and  ERF-1.  Random-sequence  oligonucleotides  to  be  used  as  a 
pool  of  potential  binding  sites  were  generated  as  described^^.  A  70  bp  oligonucleotide 
containing  a  20  bp  random  core  (APbindSITE:  CATTGATCTATTACAA 
GTCATACAGNNNNNNNNNNNNNNNNNNNNGAATCATGTTATTCAACAGTAA 
GAC)  was  annealed  to  a  25  bp  oligonucleotide  (APbindREV:  GTCTTACTGTTGAATAA 
CATGATTC)  complimentary  to  the  3'  end.  Oligonucleotides  were  labeled  with  [a-^^p] 
dCTP  (800  Ci/mmol;  Amersham)  using  Taq  DNA  polymerase  (Perkin-Elmer)  in  a  single 
cycle  as  follows:  1  min  at  94°C;  3  min  at  62°C;  9  min  at  72°C.  The  extension  reaction  was 
chased  with  additional  cold  dCTP  by  incubating  9  min.  at  72°C.  The  double-stranded 
probe  was  purified  on  8%  nondenaturing  PAGE,  excised  from  the  gel  and  eluted  in  elution 
buffer  (0.5  M  ammonium  acetate/ 1  mM  EDTA/  0.1%  SDS)  for  2  hours  at  48°C.  Eluate 
was  removed  to  a  fresh  tube  containing  20  ug  glycogen  and  DNA  was  recovered  by  ethanol 
precipitation.  The  DNA  pellet  was  resuspended  in  10  ul  dH20  and  0.5  ul  was  counted  in  a 
scintillation  counter  to  calculate  recovery. 
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Twenty  nanograms  of  GST  protein,  ERF-l-GST  fusion  protein,  or  affinity-purified 
AP2a  (Promega)  were  incubated  with  0.4  ng  of  labeled  oligonucleotide  pool  and  1  pg 
AP2a  antibody  (184X;  Santa  Cruz  Biochem,  Santa  Cruz,  CA)  in  lx  binding  buffer  (40 
mM  KCl/  20  mM  Hepes,  pH  7.9,/l  mM  MgCl2/  0.08  mM  EGTA/  0.4  mM  DTT/  4% 
Ficoll/  12  ug/ml  poly  dl-dC  (Pharmacia)/  1  mg/ml  BSA  (Sigma)).  After  the  first  round, 
binding  reactions  contained  only  0.2  ng  of  labeled  oligonucleotide  pool.  Reactions  were 
incubated  at  RT  for  30  min  to  allow  protein-DNA  complexes  to  form.  Complexes  were 
immunoprecipitated  with  protein  A  agarose  (Gibco  BRL)  at  4°C  overnight.  Unbound 
oligonucleotides  were  removed  by  gently  washing  3  times  with  lx  GSB  (40  mM  KCl/  20 
mM  Hepes,  pH  7.9,/l  mM  MgCl2/  0.08  mM  EGTA/  0.4  mM  DTT).  Bound 
oligonucleotides  were  incubated  in  200  ul  recovery  buffer  (50  mM  Tris-Cl,  pH  8, 100  mM 
sodium  acetate,  5  mM  EDTA,  0.5%  SDS)  for  1  hr  at  45°C,  then  extracted  once  each  with 
phenol  and  chloroform.  Specific  binding  site  oligonucleotides  were  recovered  by  ethanol 
precipitation  in  the  presence  of  20  ug  glycogen  and  1/5  volume  of  3  M  sodium  acetate.  The 
pellet  was  resuspended  in  5  ul  of  dH20  and  0.5  ul  was  counted  to  calculate  recovery. 

One  picogram  of  selected  DNA  was  amplified  by  PCR  in  a  reaction  with  IX  PCR 
buffer  (50  mM  potassium  chloride/  10  mM  Tris-HCl,  pH  8.3),  1  mM  MgCl,  0.08  mM 
dGTP,  0.08  mM  dATP,  0.08  mM  dTTP,  4  pM  dCTP,  0.01  mCi  [a  -32p]  dCTP  (800 
Ci/mmol),  160  ng  each  APbindFWD  primer  (CATTGATCTATTACAAGTCATACAG) 
and  APbindRev  primer,  and  2.5  units  Taq  DNA  polymerase  (Perkin  Elmer,  Branchburg). 
Reactions  were  subjected  to  18  PCR  cycles  as  follows;  94°C,  1  min.;  50°C,  1  min.;  72°C,  1 
min.  DNA  was  gel  purified  and  recovered  as  described  above.  Amplified  DNA  (0.2  ng) 
was  used  as  input  DNA  for  the  next  round  of  DNA-protein  binding.  This  cycle  of  DNA 
binding  and  amplification  was  performed  a  total  of  4  times. 

To  further  refine  the  AP2  binding  site,  a  second  series  of  binding  reactions  were 
performed  using  a  57-mer  oligonucleotide  (CATTGATCTATTACAAGTCATACASCCNN 
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NGGSAATCATGTTATTCAACAGTAAGAC;  S=G/C)  as  described,  except  only  two 
binding/amplification  cycles  were  performed. 

Selected  binding  site  oligonucleotides  obtained  after  4  binding/amplification  cycles 
were  ligated  into  pCRT^qi  using  the  Original  TA  Cloning  kit  (Invitrogen  Corp.,  San  Diego, 
CA).  Transformants  were  selected  and  plasmid  DNA  was  prepared  using  the  Wizard  Plus 
Miniprep  DNA  Purification  System  (Promega,  Madison,  WI).  Plasmid  DNA  was 
sequenced  using  the  T7  Sequenase  v2.0  DNA  Sequencing  Kit  (Amersham). 

To  generate  ERF-1  protein  for  PCR-assisted  binding  site  selection,  the  ERF-1  cDNA 
was  cloned  into  PGEX-4T-1  (Pharmacia  Biotech,  Piscataway,  NJ)  to  create  a  fusion 
protein  with  glutathione  S-transferase  (GST).  Using  the  ERF-1  clone  retrieved  from  the 
MCF7  expression  library  as  template,  ERF-1  cDNA  from  the  translation  start  site  at  +167 
to  the  stop  site  at  +15 19  was  PCR  amplified  using  a  5’  primer  (GCCGCTCGAGATGTTG 
TGGAAAATAACCGA)  and  a  3'  primer  (TCGTTCTCGAGTTATTTCCTGTGTTCTCC), 
each  of  which  contained  an  Xhol  site.  After  digestion  with  Xhol,  this  fragment  was  ligated 
into  the  Xhol  site  of  pGEX-4T-l  to  generate  ERF-l/pGEX-4T-l.  ERF-l-GST  fusion 
protein  was  induced  and  purified  using  Glutathione  Sepharose  4B  according  to  the  protocol 
provided  with  the  GST  Gene  Fusion  System  (Pharmacia  Biotech).  For  use  as  a  negative 
control,  GST  protein  was  induced  from  pGEX-4T-l  and  purified  in  the  same  manner. 

Competitive  Gel  Shift  Assay 

Competitive  gel  shift  assays  were  performed  to  determine  the  relative  binding  affinity 
of  a  series  of  mutations  in  the  optimal  binding  site  for  AP2a  and  ERF-1.  Twenty-seven 
mutant  double-stranded  oligonucleotides  were  synthesized  (Operon  Technologies)  with  all 
possible  point  mutations  in  each  position  of  the  9  bp  optimal  binding  site,  GCCTGAGGG. 
The  mutant  oligonucleotides  were  designed  as  57-mers  to  contain  the  identical  flanking 
sequence  used  throughout  the  PCR-Assisted  Binding-Site  selection  (ie.  MutlA= 
CATTGATCTATTACAAGTCATACAACCTGAGGGAATCATGTTATTCA 
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ACAGTAAGAC).  The  57-mer  containing  the  optimal  binding  site  (OPT:  GCCTGAGG 
G)  was  labeled  as  described  for  PCR-Assisted  Binding-Site  selection  and  0.2  ng  was  used 
as  probe  in  each  reaction.  Competitive  gel  shift  assays  were  performed  as  described  above 
except  that  protein  amount  was  limited  (0.2  ul  AP2a;  0.5  ul  ERF- 1/GST  fusion)  to  give  a 
50%  decrease  in  binding  with  25-75  fold  excess  of  cold  OPT  to  [a  -^^p]  dCTP-labeled 
OPT.  For  each  point  mutation,  competitor  was  added  in  increasing  amounts  from  10-  up  to 
5,000-fold  excess  over  labeled  probe. 

Images  were  obtained  by  standard  autoradiography  and  by  use  of  a  Molecular 
Dynamics  (Sunnyvale,  CA)  phoshorimager.  Quantitative  values  for  binding  were  obtained 
from  the  phosphorimager  using  ImageQuant  software  (Molecular  Dynamics).  The  fold 
excess  of  competitor  at  which  a  50%  decrease  in  binding  occurred  was  determined  using 
linear  regression  analysis.  Values  were  corrected  by  dividing  the  value  at  which  a  50% 
decrease  in  binding  was  obtained  by  the  value  at  which  a  50%  decrease  in  binding  occurred 
when  OPT  was  competed  with  itself.  This  corrected  value  is  referred  to  as  Relative 
Binding  Affinity. 

Construction  of  AP2a/pcDNA3.1(+)  and  ERF-l/pcDNA3.1(+)  Plasmids 

To  construct  AP2a/pcDNA3.1(+),  SP(RSV)AP2  (gift  from  Trevor  Williams,  New 
Haven,  CT)  was  digested  with  Kpnl  and  Xho\  to  release  a  fragment  containing  the  AP2a 
sequence  from  the  translation  start  site  at  +62  through  the  3'  untranslated  region.  This 
fragment  was  subcloned  into  KpriUXhol  digested  pcDNA3.1(+)  mammalian  expression 
vector  (Invitrogen,  San  Diego,  CA). 

ERF-l/pcDNA3.1(+)  was  constructed  by  subcloning  the  Xho\  fragment  of  ERF- 
l/pGEX-4T-l  into  the  Xhdl  site  of  pcDNA3.1(+).  This  Xhdl  fragment  contains  the  ERF- 
1  cDNA  from  the  start  site  at  +167  to  the  stop  site  at  +1519. 
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Construction  of  IL2-GL3/APWT  and  IL2-GL3/Mut3T  Reporter  Plasmids: 

To  determine  the  ability  of  AP2a  and  ERF-1  to  activate  an  optimal  binding  site  versus 
a  mutated  binding  site  in  vivo,  two  luciferase  reporter  constructs  were  made.  Two  59bp 
double-stranded,  5'-phosphorylated  oligonucleotides  with  overhanging  Xhol  sites  were 
synthesized  (Operon  Technologies)  to  contain  either  the  optimal  AP2  binding  site  (OPT: 
TCGAGATTGATCTATTACAAGTCATACAGCCTGAGGGAATCATGTTATTCAACA 
GTAAC)  or  a  mutated  binding  site  (M3T:  TCGAGATTGATCTATTACAAGTCAT 
ACAGCTTGAGGGAATCATGTTATTCAACAGTAAC).  These  oligonucleotides  were 
ligated  into  IL2-GL3  digested  with  Xhol  to  remove  90  bp  containing  3  NFAT  binding 
sites.  IL2-GL3  (gift  from  Gerald  Crabtree,  Stanford,  CA)  is  a  luciferase  reporter  plasmid 
which  contains  the  IL2  minimal  promoter  in  addition  to  the  3  NFAT  sites.  DNA  was 
prepared  as  described  above  and  the  insert  was  sequenced  to  verify  sequence  and  direction 
of  insertion. 

Transfection 

COS  cells  were  plated  at  720,000  cells/60  mm  diameter  culture  dish  the  day  before 
the  assay  to  reach  80%  confluency  at  the  time  of  transfection.  COS  cells  were  transfected 
with  a  total  of  6  pg  DNA  using  LIPOFECTAMINE’*'’^  Reagent  (GibcoBRL,  Gaithersburg, 
MD).  All  transfections  were  performed  in  triplicate.  Cells  were  harvested  24  hours  post¬ 
transfection. 

MDA-MB-231  and  HepG2  cells  were  transfected  using  FuGENE’*’’^  6  Transfection 
Reagent  (Boehringer  Mannheim,  ).  Cells  were  plated  the  day  before  to  reach  80% 
confluency  at  the  time  of  transfection  (MDA-MB-231:  7.2  x  10^  cells/60  mm  diameter 
culture  dish;  HepG2:  1.5  x  10^  cells/60  mm  diameter  culture  dish).  Each  plate  was 
transfected  with  a  total  of  2.9  pg  DNA  and  cells  were  harvested  24  hours  post-transfection. 

Whole  cell  extracts  for  gel  shift  assay  or  Western  blot  were  prepared  as  described 
above.  Cell  extracts  for  luciferase  assay  or  B-Galactosidase  assay  were  prepared  using  the 
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Luciferase  Assay  System  With  Reporter  Lysis  Buffer  (Promega).  Luciferase  assays  were 
performed  with  20  pi  of  cell  extract  and  lOOpl  of  Luciferase  Assay  Buffer.  Luciferase 
activity  was  measured  for  10  sec  in  a  Monolight®  2010  luminometer  (Analytical 
Luminescence  Laboratory,  San  Diego,  CA).  All  plates  were  cotransfected  with  pBgal- 
Control  Vector  (Clontech,  Palo  Alto,  CA)  to  measure  transfection  efficiency.  6- 
Galactosidase  activity  was  assayed  using  the  6-Galactosidase  Assay  Kit  (Stratagene,  La 
Jolla,  CA). 

Western  Blot 

To  demonstrate  AP2a  and  ERF-1  expression  in  transfected  cells.  Western  blot 
analysis  was  performed.  Whole  cell  extract  (7  pg/lane)  was  resolved  on  8%  SDS-PAGE, 
blotted  to  PVDF  membrane  and  reacted  with  AP2a  antibody  (Santa  Cruz  Biotechnology) 
using  the  ECL  Western  Blotting  Analysis  System  (Amersham  Life  Science  Ltd, 
Buckinghamshire,  England)  as  described. 

Data  Analysis 

Luciferase  data  for  AP2a  and  ERF-1  activation  of  IL2-pGL3,  OPT/IL2  and  M3T/IL2 
were  corrected  for  transfection  efficiency  by  dividing  the  relative  luciferase  units  by  6-Gal 
units  for  each  transfection.  Background  activity  of  OPT/IL2  and  M3T/IL2  was  subtracted 
from  the  6-Gal  corrected  luciferase  units  to  yield  Corrected  Luciferase  Units.  Each 
transfection  was  performed  in  tripicate  and  data  were  analyzed  for  significance  using 
ANOVA.  Significance  for  two-group  comparisons  (OPT/IL2  versus  IL2,  M3T/IL2  versus 
IL2,  and  OPT/IL2  versus  M3T/IL2)  was  tested  using  a  Bonferroni  t  test  Data  are 
expressed  as  mean  ±  SD. 
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RESULTS 

Functional  Analysis  of  the  ER  Promoter 

To  elucidate  the  mechanisms  which  regulate  transcription  of  the  ER  gene,  we  have 
performed  a  functional  analysis  of  the  ER  promoter.  A  human  genomic  lambda  library  was 
screened  with  a  probe  from  the  5'  flanking  region  of  the  ER  gene.  A  genomic  clone  was 
obtained  which  contained  3500  bp  upstream  of  the  PI  cap  site  for  ER,  the  entire  first 
coding  exon,  and  approximately  10  kbp  of  the  first  intron.  This  5'  flanking  region 
encompasses  500  bp  upstream  of  the  farthest  ER  cap  site  identified.  Regions  of  this  clone 
were  then  subcloned  into  the  luciferase  reporter  vector  pGL2-Basic.  Figure  1  shows  a 
diagram  of  the  promoter  constructs.  The  major  ER  mRNA  beginning  at  PI  contains  a  230 
base  untranslated  5'  leader  sequence  In  the  first  set  of  constructs,  all  inserts  contained 
210  bp  of  the  untranslated  leader  and  a  nested  set  of  5’  deletions  were  generated  from  3500 
bp  down  to  the  PI  cap  site  at  +1  (Figure  lA.)  In  a  second  set  of  promoter  constructs,  the 
5'  end  remained  at  3500  bp  and  3'  deletions  were  constructed  beginning  at  +230  and 
progressively  deleting  portions  of  the  leader  by  sequentially  bringing  the  luciferase  gene 
closer  to  the  PI  cap  site  (Figure  IB.) 

These  constructs  were  tested  for  luciferase  expression  upon  transfection  into  two 
breast  carcinoma  cell  lines  (Figure  2).  T47D  is  an  ER-positive  breast  carcinoma  line  in 
which  approximately  90%  of  the  ER  mRNA  begins  at  the  PI  cap  site  The  ER-negative 
carcinoma  line,  MDA-MB-231  was  also  used  since  we  have  previously  shown  this  cell  line 
to  lack  transcription  of  the  ER  gene  ^  ^ .  Luciferase  activity  in  these  two  cell  lines 
demonstrated  strikingly  different  results  (Figure  2).  In  ER-positive  T47D,  the  full-length 
construct  gave  excellent  expression.  Progressive  deletion  of  the  5'  end  of  the  gene  failed  to 
significantly  alter  expression,  although  there  was  a  reproducible  decline  of  expression  upon 
deletion  of  the  last  40  bp  of  the  5'  flanking  region  which  contains  a  TATA  element  at  +25 
(Figure  2A.)  The  pattern  of  expression  in  ER-negative  MDA-MB-231  was  qualitatively 
and  quantitatively  different.  The  full-length  construct  expressed  poorly  and  gave  values  an 
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order  of  magnitude  less  than  in  T47D.  This  level  of  expression  was  only  three  times  the 
expression  from  the  negative  control  vector  pGL2-Basic,  which  does  not  contain  a 
promoter.  Progressive  deletion  of  the  upstream  sequences  improved  expression  and  no 
significant  decline  occurred  when  the  TATA  element  upstream  of  the  PI  cap  site  was 
deleted.  Results  from  the  3'  deletion  constructs  are  shown  in  Figure  2B.  Deletions  from 
the  3'  end  of  the  leader  identified  a  sequence  between  +210  and  +135  which  augmented 
expression  of  the  ER  promoter  in  ER-positive  T47D  cells.  Analysis  of  these  constructs  in 
ER-negative  MDA-MB-23 1  was  also  performed  and  the  effect  of  this  region  appeared  to  be 
cell  line  specific.  We,  therefore,  searched  for  binding  proteins  that  interact  with  this  region 
of  the  ER  gene. 

Identification  of  ERF-1  by  Gel  Shift 

An  80  bp  probe  was  prepared  from  sequences  from  +132  to  +211  of  the  ER 
promoter.  This  probe  was  used  in  gel-shift  assays  with  extracts  prepared  from  the  ER- 
positive  breast  carcinoma  cell  lines  MCF7  and  T47D  and  the  ER-negative  MDA-MB-23 1 
cell  line.  A  prominent  shift  band  was  found  only  in  extracts  from  the  ER-positive  cell  lines 
(Figure  3).  This  complex  is  referred  to  as  ERF-1  (Estrogen  Receptor  Factor-1).  These 
results  indicate  that  a  DNA  binding  protein,  ERF-1  is  expressed  in  ER-positive  breast 
carcinoma  cells  and  interacts  with  a  region  of  the  leader  with  transcriptional  activity. 

ERF-1  Expression  in  Breast  and  Endometrial  Carcinoma 

To  determine  which  cell  lines  express  ERF-1,  a  panel  of  human  cell  lines  were 
analyzed  for  ERF-1  expression  using  the  gel  shift  assay.  These  results  are  shown  in 
Figures  4A  and  B.  Abundant  ERF-1  expression  was  found  in  all  ER-positive  breast 
carcinoma  cell  lines  tested  -MCF7,  T47D,  and  BT20.  Low  levels  of  ERF-1  were  detected 
in  normal  human  mammary  epithelial  cells  (HMEC).  HBL-100  is  an  ER-negative  breast 
carcinoma  line  which  appears  to  express  low  amounts  of  ERF-1  comparable  to  HMEC. 
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The  lack  of  expression  of  ER  in  HBL-100  could  be  due  to  any  one  of  a  number  of  reasons, 
e.g.  methylation  of  DNA,  deletion  of  the  ER  gene,  rapid  degradation  of  mRNA. 

Expression  of  ERF-1  was  also  examined  in  other  cell  lines,  including  a  panel  of 
human  endometrial  carcinoma  lines.  The  RL95-2  cell  line  is  an  endometrial 
adenocarcinoma  line  which  is  reportedly  ER-positive  This  cell  line  makes  abundant 
ERF-1  protein  as  seen  in  Figure  4B.  Examination  of  ER  expression  in  this  stock  of  RL95- 
2  has  failed  to  demonstrated  ER  expression  (data  not  shown);  however,  late  passage  of  this 
cell  line  has  been  reported  to  lose  ER  expression  30,  3 1  Therefore,  RL95-2  was  derived 
from  an  ER-positive  carcinoma  and  loss  of  ER  expression  as  the  cells  are  maintained  in 
culture  is  likely  the  result  of  DNA  methylation.  ECC-1  is  another  endometrial  cell  line 
which  is  ER-positive  and  we  have  confirmed  expression  of  ER  mRNA  (data  not  shown). 
The  ECC-1  cell  line  also  expresses  abundant  ERF-1  (Figure  4B).  ERF-1  was  not  readily 
detected  in  HEC  IB  or  HEC  lA,  which  are  both  ER-negative  human  endometrial 
carcinoma  cell  lines  These  results  suggest  that  abundant  expression  of  ERF-1 
represents  a  common  mechanism  for  ER  regulation  in  hormonally  responsive  carcinomas. 

Mapping  the  ERF-1  Binding  Site. 

Gel  shift  competitions  were  used  to  identify  two  ERF-1  binding  sites  within  the  80  bp 
region  (+132  to  +211)  of  the  ER  promoter  shown  to  bind  ERF-1.  Cold  competitor 
prepared  from  the  entire  80  bp  probe  efficiently  competes  for  binding  (Figure  3).  The 
region  from  +132  to  +171  partially  competes  and  +172  to  +211  competes  efficiently. 
Within  the  region  from  +172  to  +211,  the  sequences  from  +182  to  +201  compete 
efficiently  for  binding  (Figure  3).  Neither  +172  to  +191  nor  +192  to  +211  demonstrate 
any  competition.  Therefore,  ERF-1  binds  to  two  sites  in  this  region,  a  distal  (high  affinity) 
and  a  proximal  (low  affinity)  site. 
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Mutations  of  ERF-1  Binding  Sites  affects  Binding 

The  distal  binding  site  was  mapped  more  precisely  utilizing  gel  shift  competition  with 
oligonucleotides  containing  mutations  within  the  sequences  from  +182  to  +201.  These 
results  are  shown  in  Figure  5A.  This  region  contains  the  sequence  CCCTGCGGGG 
which  is  an  imperfect  palindrome.  The  wild-type  sequence  of  this  distal  site  (dwt) 
competes  efficiently.  Mutations  dl  and  d3  disrupt  the  imperfect  palindrome  and  destroy  the 
ability  of  the  oligonucleotide  to  compete.  Mutations  d2  and  d4  do  not  alter  the  10  bp 
imperfect  palindrome  and  retain  the  ability  to  compete  for  binding  although  d4  is  slightly 
less  efficient  than  dwt.  Mutation  d5  changes  the  T  at  +192  to  a  C  and  converts  the  sequence 
to  a  perfect  palindrome  but  partially  weakens  the  ability  to  efficiently  compete.  The 
sequence  between  +132  to  +171  that  demonstrated  weak  competition  shown  in  Figure  3, 
was  found  to  contain  a  second  ERF-1  site  located  at  +130  to  +149  (Figure  5B).  Within 
this  region,  a  related  sequence  can  be  found  and  homologous  mutations  as  used  for  the 
distal  site  have  identical  consequences  for  binding.  Interestingly,  insertion  of  a  G  between 
+140  and  +141  creates  a  site  identical  to  the  distal  site.  This  mutation  improves  the  ability 
of  the  weak  proximal  site  to  compete. 

Mutation  of  ERF-1  Sites  Affects  Expression 

Because  the  mutation  pi  destroys  binding  to  the  proximal  site  and  dl  destroys 
binding  to  the  distal  site,  these  two  mutations  were  built  into  the  expression  vector 
ER3500-230LUC  and  called  ER3500-230pldl.  This  new  vector  is  identical  to  ER3500- 
230LUC  except  for  the  two  mutations  within  the  proximal  and  distal  ERF-1  binding  sites. 
Expression  from  this  construct  is  shown  in  Figure  2B.  Mutation  of  these  ERF-1  sites  has 
an  effect  on  expression  similar  to  deletion  of  the  region  from  +135  to  +210.  These  results 
strongly  suggest  that  ERF-1  is  a  transcription  factor  which  is  expressed  in  ER-positive 
breast  carcinoma  and  which  functions  by  binding  to  two  sites  in  the  untranslated  leader  of 
the  ER  gene. 


22 


PROPRIETARY  DATA 


Mapping  of  the  ERF-1  Distal  Binding  Site 

More  precise  mapping  of  the  distal  site  was  performed  by  gel  shift  competition  using 
oligonucleotides  with  double  point  mutations  within  the  sequences  +178  to  +207.  This 
sequence  contains  the  imperfect  palindrome,  CCCTGCGGGG.  Results  are  seen  in  Figure 
6.  Mutants  7,8,9,  and  10  disrupt  the  imperfect  palindrome  and  fail  to  compete  for  binding 
to  the  wild-type  probe.  Mutations  that  target  several  nucleotides  flanking  the  palindrome 
demonstrate  partial  binding  specificity.  The  30  bp  mutants  1,2,13,14,  and  15  demonstrate 
competition  equal  to  wild-type  probe.  These  results  confirm  that  the  ERF-1  binding  site  is 
represented  by  the  imperfect  palindrome  with  some  sequence  specificity  demonstrated  by 
flanking  nucleotides. 

Purification  of  ERF-1. 

ERF-1  protein  was  purified  from  MCF7  nuclear  extract  using  ion  exchange  and  DNA 
affinity  chromatography.  The  30  bp  double  stranded  DNA  probe  comprising  sequences 
from  +178  to  +207  was  used  for  the  DNA  affinity  step.  A  diagram  of  the  purification 
protocol  is  shown  in  Figure  7.  ERF-1  activity  was  followed  through  the  purification 
process  using  gelshift  analysis. 

Molecular  Weight  Determination  of  ERF-1 

Both  UV  crosslinking  and  gel  renaturation  experiments  indicated  the  size  of  ERF-1  as 
«  50  kDa.  First,  the  bound  ERF-1  complex  resolved  by  gel  shift  was  UV  crosslinked  to  a 
radiolabeled  ERF-1  DNA  probe.  This  protein-DNA  complex  had  an  apparent  molecular 
weight  of  60-65  kDa  on  SDS/PAGE  as  shown  in  Figure  8A.  Subtracting  the  moleeular 
weight  of  the  DNA  would  indicate  a  protein  of  ~  50-55  kDa.  Second,  ERF-1  activity  was 
renatured  from  SDS/PAGE.  Purified  extract  was  electrophoresed  on  8%  SDS/PAGE  as 
shown  in  Figure  8B.  This  purified  extract  contained  two  major  bands,  as  well  as  several 
minor  species.  As  can  be  seen  in  Figure  8C,  fraction  number  7  contains  ERF-1  activity. 
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This  renatured  complex  comigrates  with  native  ERF-1  complex  It  has  also  been  shown  that 
this  renatured  complex  demonstrates  identical  binding  specificity  as  determined  for  the 
native  protein  as  shown  in  Figure  6  (data  not  shown).  All  other  fractions  are  negative  for 
binding  activity.  Fraction  number  7  corresponds  to  a  region  of  the  gel  of  «  50  kDa,  which 
agrees  with  the  size  determined  by  crosslinking  studies. 

Fraction  number  7  of  the  purified  extract  contained  two  obvious  protein  bands.  Each 
of  these  bands  was  excised  and  subjected  to  peptide  sequencing.  The  lower,  more 
prominent  band  was  identified  as  La/SS-B,  which  is  an  RNA  binding  protein  La/SS-B 
was  unlikely  to  be  ERF-1  based  upon  known  characteristics  of  this  protein.  In  addition, 
three  separate  antibodies  to  La/SS-B  failed  to  supershift  native  ERF-1  complex  (data  not 
shown).  Sequence  analysis  of  the  upper,  less  prominent  band  yielded  two  peptide 
sequences,  NPLNLPCQK  and  EFTELLSQDR.  Both  of  these  peptides  matched  precisely 
the  amino  acid  sequence  of  a  partial  cDNA  clone  for  AP2y  (GenBank  accession  no. 
X95693).  AP2y  was  identified  based  upon  homology  to  the  AP2a  transcription  factor 

Isolation  of  ERF-1  cDNA 

Two  DNA  oligonucleotide  primers  were  prepared  corresponding  to  the  two  peptide 
sequences  obtained  from  peptide  sequence  analysis.  These  primers  were  used  to  amplify 
MCF7  mRNA  utilizing  RT-PCR,  which  generated  an  expected  PCR  product  of  504  bp 
(data  not  shown).  The  PCR  product  was  used  as  a  probe  to  screen  an  MCF7  cDNA 
library.  Thirteen  separate  cDNA  clones  were  obtained,  with  the  nine  largest  clones  having 
an  identical  pattern  on  restriction  analysis.  The  size  of  these  cDNA  inserts  was  2.7  kb.  One 
of  these  cDNAs  was  sequenced  completely  and  this  sequence  is  shown  in  Figure  9 A.  This 
cDNA  contains  a  long  open  reading  frame  of  1353  bp  encoding  a  protein  with  a  predicted 
molecular  weight  of  48  kDa.  The  predicted  amino  acid  sequence  of  the  ERF-1  protein 
matched  precisely  the  amino  acid  sequence  reported  for  AP2y.  The  ERF-1  cDNA  contains 
a  166-bp  5'  untranslated  leader  and  a  1268-bp  3'  untranslated  region.  Alignment  of  the 
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ERF-1  protein  with  AP2a  is  shown  in  Figure  9B.  Overall,  there  is  a  65%  identity  and 
83%  similarity  between  these  two  proteins.  Homology  is  most  striking  within  the 
carboxyl-terminal  half  of  the  proteins  which  contains  the  DNA  binding  and  dimerization 
domains.  Within  the  carboxyl-terminal  half  there  is  76%  identity  and  90%  similarity 
between  these  two  proteins. 

Confirmation  of  ERF-1  Identity.  The  identity  of  ERF-1  was  confirmed  by  analysis 
of  in  vitro  synthesized  product  and  by  antibody  reactivity.  The  ERF-1  cDNA  was 
subjected  to  in  vitro  transcription/translation.  As  can  be  seen  in  Figure  lOA,  the  in  vitro 
product  has  a  molecular  weight  of  =50  kDa.  A  polyclonal  antisera  generated  against  the 
carboxyl-terminal  20  amino  acids  of  AP2a  is  able  to  immunoprecipitate  the  in  vitro 
synthesized  product.  The  T7  promoter  is  directed  in  the  opposite  orientation  and,  as 
expected,  no  product  is  obtained.  In  vitro  synthesized  ERF-1  was  prepared  in  the  absence 
of  35S-labeled  amino  acids.  This  cold  protein  was  analyzed  in  a  gel  shift  assay  as  shown  in 
Figure  lOB.  In  vitro  synthesized  ERF-1  generates  an  ERF-1  complex  that  comigrates  with 
the  native  ERF-1  complex.  This  product  also  has  the  same  DNA  sequence  specificity  as 
demonstrated  with  competition  using  mutant  oligonucleotide  sequences.  In  addition,  AP2a 
antisera  supershifts  the  in  vitro  ERF-1  complex.  As  expected,  the  extract  prepared  using 
the  T7  polymerase  does  not  generate  the  ERF-1  complex.  The  AP2a  antisera  should  also 
identify  the  native  ERF-1  complex.  As  can  be  seen  in  Figure  11,  this  antisera  supershifts 
all  of  the  ERF-1  complex  from  MCF7  cells  This  supershifted  complex  also  demonstrates 
the  same  pattern  of  sequence  binding  as  determined  by  mutant  competitors. 

It  is  evident  that  the  ERF-1  binding  sequence  is  similar  to  the  sequence  specificity  of 
AP2a  (GCCNNNGGC)  and  AP2a  antisera  cross  reacts  with  ERF-1  protein.  Given 
the  extensive  homology  between  AP2  family  members,  it  might  be  questioned  whether  the 
ERF-1  complex  is  formed  by  AP2a  homodimers.  However,  several  considerations  make 
this  possibility  unlikely.  First,  HeLa  cells  express  AP2a  whereas  ERF-1  activity  was  not 


25 


PROPRIETARY  DATA 


readily  demonstrated  in  this  cell  line  and  several  breast  carcinoma  cell  lines  known  to 
express  ERF-1  do  not  express  AP2a  Second,  although  cloned  AP2a  protein 

binds  to  the  wild-type  ERF-1  probe  by  gelshift  analysis,  the  complex  formed  demonstrates 
different  binding  specificities  for  mutant  competitors  and  has  slightly  faster  mobility  as 
compared  with  native  ERF-1  complex.  Finally,  the  peptides  obtained  from  sequencing 
purified  ERF-1  differ  from  the  amino  acid  sequence  for  AP2a.  These  findings  indicate  that 
ERF-1,  previously  shown  to  be  involved  in  regulation  of  the  ER  gene  promoter,  is  an  AP2 
family  member  distinct  from  AP2a. 

PCR-Assisted  Binding  Site  selection 

PCR- Assisted  binding  site  selection  identified  the  preferred  binding  site  for  AP2a  as 
GCCC/g/t^/C/t^/A/c^/AGG/c  (Table  1)  and  that  for  ERF-1  as 
GCCG/T/g^/C^/A/cGGG/c  (Table  2).  To  provide  a  starting  point,  binding  site  clones 
were  searched  for  sequences  beginning  with  GCC  as  described  previously  for  the  AP2a 
consensus  site  (GCCNNNGGC)  Of  132  selected  clones  sequenced  for  AP2a,  108 
clones  contained  one  or  more  binding  sites  beginning  with  GCC,  5  clones  contained  no 
binding  sequence  fitting  this  criteria,  and  19  clones  contained  unreadable  sequence.  For 
129  clones  sequenced  for  ERF-1,  92  clones  contained  one  or  more  binding  sites,  8  clones 
contained  no  binding  site,  and  29  clones  had  unreadable  sequence. 

Many  clones  contained  multiple  binding  sites,  either  on  the  same  and/or  on  the 
complementary  strand,  making  it  impossible  to  know  which  sequence  was  primarily 
responsible  for  selection  of  the  clone.  Because  this  may  have  biased  the  results,  the 
sequences  were  also  analyzed  to  identify  clones  in  which  only  a  single  binding  site 
appeared.  This  analysis  created  some  changes  in  preferred  binding  site  sequence,  most 
notably  at  positions  5  and  6  for  AP2a  and  at  position  7  for  ERF-1  (Tables  3  and  4 ). 

To  further  refine  the  binding  site  for  AP2a  and  ERF-1,  PCR-Assisted  Binding  Site 
selection  was  repeated  using  a  probe  containing  the  binding  site  SCCNNNGGS  (S=G/C). 
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This  selection  cycle  defined  the  preferred  AP2a  binding  site  to  be  GCCQg/tGNGG^/C 
(Table  5)  and  the  preferred  ERF-1  binding  site  to  be  GCCT/C/qG^/G/cGG^/C  (Table  6). 
Of  36  clones  sequenced  for  each  protein,  readable  sequences  were  obtained  for  27  AP2a 
clones  and  for  25  ERF-1  clones. 

Competitive  Gel  Shift  Assay 

Competitive  gel  shift  assays  using  a  series  of  competitors  with  all  possible  single 
point  mutations  in  the  optimal  binding  site  demonstrated  that  there  were  no  significant 
differences  in  binding  site  preference  between  AP2a  and  ERF-1  (Figures  12  and  13). 
Assuming  that  any  competitor  with  a  Relative  Binding  Affinity  of  6  or  lower  bound  protein 
with  equal  affinity  as  OPT  (GCCTGAGGC),  the  preferred  binding  site  for  both  AP2a  and 
ERF-1  was  determined  to  be  G/cCCNN^/C/g^/A^/A^/C/t- 

Transfections  of  COS  cells 

Cotransfection  of  COS  cells  with  AP2a  or  ERF-1  and  luciferase  reporter  constructs 
containing  either  the  optimal  binding  sequence  (OPT/IL-2)  or  a  mutated  binding  sequence 
(M3T/IL-2)  demonstrated  that  both  proteins  activated  OPT/IL-2  to  a  significantly  higher 
level  than  M3T/IL-2  over  a  range  of  protein  concentrations  (Figures  14C  and  15C).  For 
AP2a,  activation  of  OPT/IL-2  was  significantly  higher  than  M3T/IL-2  when  transfections 
were  performed  with  0.1,  0.5,  1,  or  2  pg  protein.  There  was  no  significant  differences  in 
activation  at  0,  0.2,  or  3  pg.  For  ERF-1,  activation  of  OPT/IL-2  was  significantly  higher 
than  M3T/IL-2  at  all  protein  concentrations  at  or  above  0.2  pg.  For  AP2a,  the  levels  of 
activation  of  both  OPT/IL-2  and  M3T/IL-2  were  significantly  higher  than  activation  of  a 
reporter  construct  containing  only  the  IL-2  minimal  promoter  (IL-2)  except  at  1  ug  for 
M3T/IL-2  vj'  IL-2.  For  ERF-1,  activation  of  both  OPT/IL-2  and  M3T/IL-2  were 
significantly  higher  than  IL-2  at  all  protein  concentrations  above  0  pg.  Overall,  activation 
of  both  OPT/IL-2  and  M3T/IL-2  by  ERF-1  was  3.3  fold  higher  than  activation  by  AP2a. 
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Western  blot  analysis  and  gel  shift  assay  show  that  expression  and  DNA  binding  of  AP2a 
and  ERF-1  increases  linearly  as  COS  cells  are  transfected  with  increasing  amounts  of  these 
proteins  (Figure  14A  and  B;  Figure  15A  an  B). 


Transfection  of  HepG2  and  MDA-231  Cells 

Because  ERF-1  was  a  better  activator  of  OPT/IL-2  and  M3T/IL-2  in  COS  cells  than 
AP2a,  activation  of  a  full-length  ER  promoter  construct  (ER3500-230LUC)  by  ERF-1  and 
AP2a  was  tested  in  HepG2  and  MDA-231,  two  cell  lines  lacking  expression  of  these 
proteins.  As  described  above,  the  ER  promoter  contains  a  10  bp  imperfect  palindrome 
(CCCTGCGGGG)  that  acts  as  a  binding  site  for  ERF-1.  To  fit  into  the  9  bp  pattern 
determined  for  optimal  DNA  binding,  this  sequence  can  be  viewed  as  either 
CCCTGCGGG  or  CCTGCGGG,  the  latter  being  equivalent  to  the  Mut  3T  mutation. 
Preliminary  data  showed  that  in  HepG2  cells,  the  activation  of  ER3500-230LUC  by  both 
AP2a  and  ERF-1  is  greater  than  activation  of  PGL2-Basic  which  contains  no  promoter 
(Figure  16A).  However,  in  MDA-MB-231  cells,  only  ERF-1  is  able  to  activate  ER3500- 
230LUC  above  background  levels  (Figure  16B). 

DISCUSSION 

Although  several  mechanisms  are  involved  in  controlling  expression  of  ER  in  breast 
carcinomas,  transcriptional  regulation  is  clearly  responsible  for  the  ER-negative  phenotype 
in  some  cell  lines  These  data  demonstrate  that  an  important  transcriptional 

regulatory  element  of  the  ER  gene  is  found  within  the  5'  untranslated  leader.  This  region 
contains  two  binding  sites  for  a  DNA  binding  protein,  ERF-1,  which  is  abundantly 
expressed  in  ER-positive  breast  and  endometrial  carcinomas.  These  data  indicate  a  role  for 
ERF-1  in  the  transcriptional  regulation  of  ER.  There  are  likely  other  cis-acting  control 
elements  involved  in  the  regulation  of  ER  transcription.  For  example,  the  transcriptional 
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mapping  data  also  implicate  positive  and  negative  regulatory  elements  upstream  of  the  PI 
transcriptional  start  site  and  there  may  be  other  regions  that  have  not  been  examined  in  the 
present  study.  However,  the  identification  of  ERF-1  offers  a  molecular  mechanism  that 
accounts  for  differences  in  ER  expression  found  in  breast  carcinomas. 

The  complexity  of  ER  transcriptional  regulation  is  suggested  by  the  biology  of  ER 
expression.  ER  is  over-expressed  in  many  ER-positive  breast  carcinomas  that  often  make 
over  100  fmol/mg  cytosol  protein  compared  to  4  fmol/mg  cytosol  protein  in  normal 
mammary  cells  These  differences  in  ER  expression  are  reflective  of  mRNA  levels  and 
there  are  striking  differences  in  the  level  of  ER  mRNA  detected  in  different  breast 
carcinoma  lines  It  is  certainly  possible  that  ERF-1  may  be  responsible  for  ER  over¬ 
expression  identified  in  many  ER-positive  carcinomas.  However,  more  detailed 
experiments  need  to  be  performed  to  address  this  possibility  directly. 

During  this  study,  we  have  now  cloned  the  gene  for  ERF-1  and  have  established 
that  ERF-1  is  a  member  of  the  AP2  transcription  factor  family.  Three  members  of  this 
family  have  been  identified,  AP2a  AP26  and  AP2y^^.  The  protein  sequence  for  ERF- 
1  matches  precisely  the  predicted  amino  acid  sequence  for  AP2y.  The  identification  of 
ERF-1  as  a  member  of  a  well-characterized  transcription  factor  family  provides  additional 
proof  supporting  the  role  of  ERF-1  in  ER  gene  regulation.  However,  it  remains  to  be 
determined  if  ERF-1  is  responsible  for  basal  tissue-specific  expression  of  ER  or  if  ERF-1 
induces  the  over-expression  of  ER  that  is  characteristic  of  many  ER-positive  carcinomas. 
Identification  of  the  ERF-1  gene  now  facilitates  the  resolution  of  these  mechanistic  issues. 

The  homology  of  ERF-1  with  AP2a  raises  several  points  of  speculation  concerning 
the  function  and  biology  of  ERF-1.  AP2a  activates  transcription  by  binding  as  a 
homodimer  to  specific  DNA  sequences  It  has  also  been  shown  that  in  vitro  synthesized 
AP2a,  AP2y  and  AP26  can  form  heterodimers  with  one  another  and  can  bind  to  an  AP2a 
consensus  site  It  is  not  known  if  heterodimers  between  AP2  family  members  occur  in 
vivo  or  whether  they  are  physiologically  relevant.  However,  heterodimer  formation 
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between  ERF-1  and  other  members  of  the  AP2  family  raises  an  additional  potential  for 
complex  gene  regulation.  The  formation  of  heterodimers  may  alter  normal  function  of  AP2 
proteins  or,  alternatively,  may  generate  complexes  with  different  specificity.  Germane  to 
gene  regulation  in  breast  carcinomas,  recent  reports  have  implicated  AP2  factors  in  the 
transcriptional  control  of  c-erbB-2  E-cadherin  and  HSP27  It  is  certainly 
plausible  to  hypothesize  a  role  for  ERF-1  in  the  coordinate  regulation  of  a  set  of  genes  in 
hormonally  responsive  carcinomas.  This  conjecture  is  supported  by  the  fact  that  breast 
carcinoma  cell  lines  that  express  E-cadherin  and  HSP27  are  also  ERF-1  positive,  whereas 
cell  lines  with  low  level  expression  of  these  genes  are  ERF-1  negative 
Identifying  other  genes  regulated  by  ERF-1  will  be  an  important  area  for  futher  study. 

Using  PCR-Assisted  Binding  Site  selection  in  conjunction  with  competitive  gel 
shift  assay,  this  study  also  demonstrates  that  AP2a  and  ERF-1  have  identical  DNA  binding 
specificities.  Previously,  as  determined  by  methylation  interference  assay  and  missing 
contact  probing  assay,  the  consensus  binding  site  for  AP2a  was  determined  to  be 
GCCN3GGC  An  earlier  study  had  derived  an  slightly  different  AP2a  consensus  site 
using  DNA  footprinting  studies  of  four  genes  (hGH,  R-2KP,  hc-myc,  and  BPV)  to  have  a 
T  in  the  first  position  of  a  10  bp  site  ("^/cCQgCC^/CNG'/cC/qG/^^)  42  However,  this 
consensus  site  was  derived  from  a  very  small  sample  of  naturally  occurring  AP2a  binding 
sites  and  no  experiments  were  performed  to  determine  the  affinity  of  the  site  or  the  relative 
importance  of  each  nucleotide.  Although  it  has  not  been  formally  tested,  it  was  assumed 
that  AP2a,  AP2y  and  AP2B  all  had  the  same  binding  specificity  due  to  the  fact  that  they  can 
bind  to  a  high  affinity  AP2a  consensus  site  in  a  gel  shift  assay  and  are  able  to  activate  a 
reporter  construct  containing  three  tandem  copies  of  this  site  However,  in  the  same 
study  it  was  also  demonstrated  that  AP2B  activated  a  reporter  construct  containing  the  AP2 
binding  site  from  the  c-erbB-2  3-fold  less  than  did  AP2a  and  AP2y,  indicating  that  there 
may  be  some  subtle  differences  in  binding  specificity  between  these  proteins.  To  our 
knowledge,  a  systematic  study  in  which  each  position  of  the  DNA  binding  site  was 
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thoroughly  mutated  had  not  been  previously  undertaken.  Although  for  the  most  part  this 
study  eonfirmed  that  the  optimal  binding  site  for  AP2a  and  ERF-1  fits  within  the 
previously  described  consensus  site,  we  were  surprised  to  find  that  a  mutated  binding  site 
with  a  T  in  the  position  9  had  the  same  relative  binding  affinity  as  a  site  containing  either  a 
G  or  C.  In  addition,  due  to  the  palindromic  nature  of  the  consensus  site,  it  is  surprising 
that  a  mutated  binding  site  with  an  A  in  position  1  destroyed  its  ability  to  bind  both  AP2a 
or  ERF- 1. 

If  there  are  no  differences  in  binding  specificity  for  AP2a  and  ERF-1,  then  any 
potential  functional  differences  exhibited  by  these  transcription  factors  must  be  due  to  other 
factors  such  as  spatially  or  temporally  restricted  expression  patterns  or  differences  in  the 
cohort  of  cofactors  required  for  activation.  In  support  of  this,  it  was  shown  in  mouse 
embryos  that  there  are  distinct  patterns  of  AP2a  and  AP26  expression  in  the  developing 
brain  and  face  Additionally,  Western  and  Northern  blot  analysis  of  a  panel  of  human 
mammary  carcinoma  cell  lines  also  indicated  overlapping  but  distinct  expression  patterns 
for  AP2a,  AP26,  and  AP2y,  with  a  4-5-fold  amplification  of  the  AP2y  gene  in  two  cell 
lines,  BT474  and  SKBR3  Several  studies  suggest  that  competition  from  adjacent 
DNA-binding  proteins  is  able  to  displace  AP2a  from  its  binding  site  44-46  However  the 
analagous  experiments  have  not  been  done  for  AP2y  and  to  date,  no  interacting  cofactors 
for  either  AP2a  or  AP2Yhave  been  identified. 

The  validity  of  selecting  an  optimal  binding  site  using  PCR  selection  and 
competitive  gel  shift  assay  was  examined  in  vivo  by  testing  the  activation  of  reporter 
constructs  containing  either  the  optimal  binding  site  or  a  mutated  site  with  a  T  in  position  3 
(M3T).  This  mutation  was  determined  to  be  of  lower  affinity  by  competitive  gel  shift 
assay.  This  experiment  shows  that  a  reporter  construct  containing  the  M3T  mutated  site  is 
activated  2.2-fold  less  than  the  construct  containing  the  optimal  binding  site  at  certain 
amounts  of  imput  AP2a  (Figure  14C)  and  1.4-fold  less  with  ERF-1  (Figure  15C).  For 
AP2a,  the  difference  in  activation  of  the  optimal  and  mutated  sites  is  not  significantly 
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different  with  3  ug  of  imput  AP2a.  It  is  possible  that  when  AP2a  is  overexpressed  to 
certain  levels,  the  distinction  between  low  and  high  affinity  binding  sites  is  negated.  AP2a 
and  ERF-1  expression  in  normal  breast  epithelium  is  very  low  while  in  certain 
mammary  carcinoma  cell  lines  the  expression  of  one  or  both  of  these  proteins  is  greatly 
overexpressed  Therefore,  it  is  possible  that  overexpression  of  AP2a  or  ERF-1  in 

breast  cancer  may  activate  a  set  of  genes  containing  lower  affinity  binding  sites  that  would 
normally  not  be  activated  in  normal  breast  epithileum. 

Preliminary  experiments  in  HepG2  and  MDA-MB-231  cells,  two  cell  lines  lacking 
AP2  activity,  show  that  the  ability  of  AP2a  or  ERF-1  to  activate  a  reporter  construct 
containing  the  ER  promoter  is  dependent  on  cell  type  (Figure  16A  and  B).  In  HepG2  cells, 
a  hepatic  cell  line,  both  AP2a  and  ERF-1  are  able  to  activate  the  ER  promoter  construct 
above  background  levels  with  AP2a  demonstrating  a  2.5-fold  greater  activation  than  ERF- 
1.  However,  in  MDA-MB-231  cells,  an  ER-negative  breast  carcinoma  cell  line,  only  ERF- 
1  is  able  to  activate  the  ER  promoter  construct.  It  is  possible  that  there  are  breast  cell- 
specific  factors  present  in  MDA-MB-231  cells  that  are  absent  in  a  hepatic  cell  such  as 
HepG2  which  repress  AP2a  activation  while  allowing  ERF-1  activation  of  the  ER 
promoter  construct  to  occur. 

CONCLUSIONS 

The  identification  and  cloning  of  ERF-1  offers  new  insight  into  our  understanding  of 
the  relationship  between  ER  expression  and  the  biology  of  breast  and  endometrial 
carcinoma.  ER  expression  defines  a  subset  of  breast  cancer  patients  who,  in  general,  have 
a  better  prognosis  compared  to  patients  with  ER-negative  tumors.  Because  ER  is  a 
transcription  factor,  it  has  been  suggested  that  the  phenotype  displayed  by  ER-positive 
breast  carcinomas  is  due  to  the  repertoire  of  genes  whose  expression  is  regulated  through 
estrogen  response  elements  (ERE).  Alternatively,  ER  expression  might  be  a  marker  for  the 
degree  of  differentiation  of  a  tumor  and  ERF-1  might  be  involved  in  the  regulation  of  a 
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number  of  cellular  genes,  including  ER,  which  are  critical  to  the  differentiated  phenotype. 
The  identification  of  ERF-1  has  immediate  clinical  relevance.  For  example,  tumors  that 
lack  ERF-1  expression  might  define  a  subset  of  cancer  patients  with  a  prognosis  different 
than  patients  with  ER-negative  tumors  where  loss  of  expression  is  due  to  mutations  within 
the  ER  gene.  If  ERF-1  expression  is  a  more  reliable  marker  of  a  clinically  relevant 
phenotype,  this  would  indicate  that  some  ERF-1  responsive  gene,  other  than  ER,  is  critical 
to  the  phenotype  of  ER-positive  carcinomas.  Understanding  the  control  of  ERF-1  may  also 
provide  new  therapeutic  approaches  to  the  treatment  of  aggressive  ER-negative  tumors. 
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FIGURE  LEGENDS 

Figure  1.  ER  promoter  constructs. 

Diagram  of  ER  promoter  constructs  cloned  in  luciferase  expression  vector  pGL2- 
Basic.  Dark  line  represents  DNA  sequence  derived  from  ER  gene.  Arrow  shows  location 
of  PI  transcriptional  start  site.  All  numbers  correspond  to  distance  from  PI  cap  site. 
Location  of  Ndel  site  at  -45  and  Xhdl  site  (in  vector)  are  shown.  ER  coding  region  is 
indicated  in  box.  Luciferase  coding  region  is  designated  by  box  labelled  LUC.  In  plasmid 
names,  first  number  corresponds  to  DNA  length  upstream  of  PI  and  second  number  is 
DNA  length  downstream  of  PI.  A.  5’  deletion  constructs  are  shown.  All  these  constructs 
contain  210  bp  of  the  230  bp  5'  untranslated  leader.  B.  3'  deletion  constructs  are  shown. 
All  the  constructs  contain  3500  bp  upstream  of  PI  and  variable  portions  of  the  untranslated 
leader. 
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Figure  2.  Luciferase  activity  from  ER  promoter  constructs. 

Plasmid  DNA  was  transfected  into  T47D  (ER-positive)  or  MDA-MB-23 1  (ER- 
negative)  breast  carcinoma  cells  and  assayed  for  luciferase  activity.  Data  presented  is 
corrected  for  transfection  efficiency.  Data  was  normalized  with  the  activity  obtained  from 
the  largest  plasmid  in  T47D  as  100%.  Standard  error  is  shown  by  error  bars.  (A) 
Representative  results  of  expression  from  5'  deletion  constructs.  Values  shown  are 
average  of  four  separate  transfection  experiments.  (B)  Representative  results  of  expression 
from  3'  deletion  constructs.  Values  shown  are  average  of  eight  separate  transfections. 
Data  for  ER3500-230pldl  is  average  of  two  separate  transfections  performed  in  triplicate 
and  is  representative  of  other  transfections  performed. 
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Figure  3.  Gel  shift  assay  using  80  bp  probe. 

Gel  shift  assay  was  performed  using  whole  cell  extracts  from  cells  as  shown.  The 
probe  is  a  radiolabelled  80  bp  DNA  fragment  from  ER  promoter  sequences  +132  to  +21 1. 
Gel  shift  competition  involved  addition  of  500  fold  molar  excess  cold  competitor  from 
sequences  of  ER  gene  as  shown.  In  (-)  lanes  there  is  no  competitor  added. 
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Figure  4.  ERF-1  expression  in  human  cell  lines. 

ERF-1  expression  determined  by  gel-shift  assay  using  whole  cell  extracts  prepared 
from  cell  lines  shown  and  reacted  with  a  72  bp  probe  encompassing  proximal  and  distal 
ERF-1  sites.  (A)  Human  breast  carcinoma  lines;  ER-positive:  MCF7,  T47D,  BT20  and 
ER-negative:  MDA-MB-231  and  HBL-100.  HMEC  are  normal  human  mammary 
epithelial  cells.  (B)  Other  human  cell  lines;  HeLa  are  cervical  carcinoma  cells;  Daudi  are 
lymphocytes;  RL95-2,  HEC  IB,  HEC  lA,  and  ECC-1  are  human  endometrial  carcinoma 
cell  lines  (see  text  for  details). 
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Figure  5.  Mapping  ERF-1  binding  sites. 

Gel  shift  competitions  were  performed  using  whole  cell  extract  from  cells  shown. 
Probe  used  in  both  A  and  B  is  a  72  bp  DNA  fragment  from  ER  promoter  sequences  +130 
to  +201.  (A)  Competitions  using  mutant  distal  binding  sites.  Sequence  of  wild-type  distal 
site  (dwt)  and  position  of  mutated  sequence  is  shown  for  each  mutant  sequence  dl  -  d5. 
(B)  Proximal  binding  site  defined  using  proximal  mutants  pi  -  p4.  Below  gel  is  shown 
sequences  of  distal  site  (dwt)  and  proximal  site  wild-type  sequence  aligned  to  highlight 
homology.  Sequence  of  proximal  site  wild-type  sequence  (pwt)  and  four  proximal  site 
mutants  pi  -  p4  is  shown. 
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Figure  6.  Mutational  Analysis  of  ERF-1  Binding  Site 

Gel  shift  analysis  using  nuclear  extract  prepared  from  MCF7  cells  (lanes  2-19)  or 
MDA-MB-231  (lane  20)  with  30  bp  wild-type  oligonucleotide  as  probe.  Competition  with 
mutant  oligonucleotides  as  listed. 
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Figure  7.  Purification  of  ERF-1 . 

Protocol  for  purification  of  ERF-1  protein  using  ion  exchange  and  DNA  affinity 
chromatography.  See  text  for  details. 
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Figure  8.  Size  Determination  of  ERF-1 

A:  ERF-1  complex  was  resolved  by  gel-shift  analysis  and  the  gel  was  subjected  to 
UV-crosslinking.  The  bound  complex  was  excised  and  resolved  on  SDS-PAGE  as 
shown.  The  location  of  molecular  weight  markers  (kD)  is  shown  to  the  left  of  the  gel  lane. 
B:  Purified  extract  was  electrophoresed  on  8%  SDS-PAGE  and  sectioned  into  12  gel 
slices.  The  approximate  location  of  gel  slices  1  through  12  are  shown  to  the  right  of  the  gel 
lane.  Molecular  weight  markers  (kD)  are  listed  to  the  left  of  the  lane  labelled  markers.  C: 
Gel  shift  analysis  of  proteins  renatured  from  gel  slices  in  B.  Fraction  number  7 
demonstrated  ERF- 1  activity. 
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Figure  9.  ERF-1  cDNA 

A:  Complete  sequence  of  ERF-1  cDNA  with  predicted  amino  acid  sequence. 
Boxes  indicate  peptide  sequences  obtained  from  protein  sequencing  of  purified  ERF-1.  B: 
Comparison  of  ERF- 1  protein  sequence  with  AP2a  protein  aligned  to  highlight  similarities. 
(*)  indicates  identical  amino  acids;  (.)  indicates  conserved  substitution. 


57 


Figure  9 . 


I  TCOCAOAa  • 

9  ctfQccqjiTOCgroTCCJioTaiccooaigJLQciAoqcccocoeoeoocoooQQCoocoacM*cgccTOOTCACCcffaxc  I7 

1<7  ATO  TTO  TOO  JUUk  kTK  MCC  Okt  AAT  OTC  JULO  TAC  OJLA  OlO  OAC  tOC  OAO  OAT  COC  CAC  OAC  231 
lllI,WKITDITKriBSC*DKBD20 

227  OOO  A0C  AOC  AAT  000  AAT  CCO  COO  OTC  CCC  CAC  CTC  TCC  TCC  OCC  000  CAO  CAC  CTC  TAC  79* 

21Of*aaM»lV»BLOAAOQBLY40 

2t7  AOC  CCC  OCO  CCA  CCC  CTC  TCC  CAC  ACT  OOA  OTC  OCC  OAA  TAT  CAO  CCO  CCA  CCC  TAC  TTT  341 

4i«rA»fLiiTovABTorPTyr<o 

347  CCC  CCT  CCC  TAC  CAO  CAO  CTO  OCC  TAC  TCC  CAO  TCO  OCC  OAC  CCC  TAC  TCO  CAT  CTO  000  404 
OlPPPTQOLAYlOAAOPyABLOiO 

407  OAA  OCO  TAC  OCC  OCC  OCC  ATC  AAC  CCC  CTO  CAC  CAO  CCO  OCO  CCC  ACA  OOC  AOC  CAO  CAO  444 
•  lBAyAAAZMPI.aQPAPTO«QQ100 

447  CAO  OCC  TOO  CCC  OOC  COC  CAO  AOC  CAO  OAO  OOA  OCO  000  CTO  CCC  TCO  CAC  CAC  OOO  COC  534 
lOlOBNPOBQSQCOAOLPBBBaKlOO 

537  CCO  OCC  OOC  CTA  CTO  CCC  CAC  CTC  TCC  OOO  CTO  OAO  OCO  OOC  OCO  OTO  AOC  OCC  COC  AGO  544 
131PAOLLPBZ.COLBAOAVBAItK140 

587  OAT  OCC  TAC  COC  COC  TCC  OAC  CTO  CTO  CTO  CCC  CAC  OCA  CAC  OCC  CTO  OAT  OCC  OCO  OOC  444 
141OAYItlllDLI.Z.PHABAX,SAAO140 

447  CTO  OCC  OAO  AAC  CTO  OOO  CTC  CAC  OAC  ATO  CCT  CAC  CAO  ATO  OAC  OAO  OTO  CAO  AAT  OTC  704 
14tLAIBLaLBOMPafiMDBVfiBY180 

707  OAC  OAC  CAO  CAC  CTO  TTO  CTO  CAC  OAT  CAO  ACA  OTC  ATT  COC  AAA  OOT  CCC  ATT  TCC  ATO  744 
181SOQaiit.LaO0TViaKOPtBM300 

747  ACC  AAO  AAC  CCT  CTO  AAtT CTC  C«  TOT  CAO  AAO  OAO  CTO  OTO  OOO  OCC  OTA  ATO  AAC  CCC  824 
201  T  X  BPABLPCQK  BtVOAYMBP230 

837  ACT  OAO  ore  TTC  TOC  TCA  OTC  CCT  OOA  AOA  TTO  TCO  CTC  CTC  AOC  TCT  ACO  TCT  AAA  TAC  884 
221  TBVPC«YPOBL«t.L«BT«KY340 

887  AAA  OrO  ACA  OTO  OCT  OAA  OTA  CAO  AOO  COA  CTO  TCC  CCA  CCT  OAA  TOC  TTA  AAT  OCC  TCO  >44 
241  XVTVABV()BXLOPPBCLaAB340 

>47  TTA  CTO  OOA  OOT  OTT  CTC  AOA  AOA  OCC  AAA  TCO  AAA  AAT  OOA  OOC  COO  TCC  TTO  COO  OAO  1004 
241  LZ.OOVLKXAKBK«oaxOI.XB3BO 

1007  AAO  TTO  OAC  AAO  ATC  OOO  TTO  AAT  CTT  CCO  OCC  OOO  AOO  COO  AAA  OCC  OCT  CAT  OTO  ACT  1044 
281Kt.DKZOLIiX.PAailXKAAaVT300 

1047  CTC  CTO  ACA  TCC  TTA  OTA  OAA  OOT  OAA  OCT  OTT  CAT  TTO  OCT  AOO  OAC  TTT  OCC  TAT  OTC  1134 
301  ALT  a  AViaBAVBAABI)PATV320 

1137  TOT  OAA  OCC  OAA  TTT  CCT  AOT  AAA  CCA  OTO  OCA  OAA  TAT  TTA  ACC  AOA  CCT  CAT  CTT  OOA  1184 
32ieBABrPfXPVAayATXPBAO340 

1187  OOA  COA  AAT  OAO  ATO  OCA  OCT  AOO  AAO  AAC  ATO  CTA  TTO  OCO  OCC  CAO  CAA  CTO  TOT  AAA  U44 
341OKa8MAAXKMat.tAA0Q]>C  X340 

1347  OAA  TTC  ACA  OAA  CTT  CTC  AOC  CAA  OAC  COT  ACA  CCC  CAT  003  ACC  AOC  AOO  CTC  OCC  CCA  1304 
341  BfTBLLaQPX  TPBOttBLAPliO 

1307  ore  TTO  OAO  ACO  AAC  ATA  CAO  AAC  TOC  TTO  TCT  CAT  TTC  AOC  CTO  ATT  ACC  CAC  OOO  TTT  1344 
38ivLiTazQfletiaapat.zTBor400 

1347  OOC  AOC  CAO  OCC  ATC  TOT  OCC  OCO  OTO  TCT  OCC  CTO  CAO  AAC  TAC  ATC  AAA  OAA  OCC  CTO  1434 
40loaOAI  CAAVaALOItTZKBA£.43D 

1437  ATT  ore  ATA  OAC  AAA  TCC  TAC  ATO  AAC  CCT  OOA  OAC  CAO  AOT  CCA  OCT  OAT  TCT  AAC  AAA  1484 
4311VSDXaTaMPODQaPADaBX440 

1487  ACC  CTO  OAO  AAA  ATO  OAO  AAA  CAC  AOO  AAA  TAA  AArTOQAArOAAaAAAOOrrTAOaAOAOTAOOOAAO  1554 

441TtXKaBKBXK  • 

1713  TOCTAACTTTOOACCTATTATCAaAAOOTOACAAOTACTOOCTCTTTATTCATTAAaCTTTrrrTTTTTaAACCCCATT  1781 
1793  C'PWrCCTTC'rCTaAAAgTOOTOCtATAAOTTTfAQAATCTTTTAAATACATTCCCTOOOCrAArAaArceACACACTTA  1870 
1871  OCCATTQAAATOTCAAATTqATorOCCCTAaATCAACAaATCAACAATACCTTTiTi'rTCAOTOTTAAOOTAATQOlTO  1949 
1950  OrnTTOTOTCCOCTAAATATTTACCTrOAAAAAAAOAAAAOTOrOTATCTAOCrTLTTCAQAOATCAAOTCCTCTOOT  3038 
2039  AOQAOOCAAAQOtTCTATCTOCTrAOCAACTAOTTAATAAOTOOTATCTOACACACTCTAAACCCCOTOTTCAAACOOO  2107 

2187  AACTTTaAAAOrTAACTCrrcAAATOOOAaACTCTrTaAAATOACATOITCCClTAAOOTACTOAAaCTrrAlTTaCAT  3345 

3345  COrOQAOOrTCTOTATTTTAAaAAACAiOfUCOTrOAOTOTACAaATTrrATTTATOCOTAATTTAATOOOOTCTOTAAA  3423 
2424  TACTOOTOCACTTCTTACOACl  PI i  H UAOACATOOOATCCAATTTTAATATTAACTrTTAATOOrOATOOOOTAATCT  3503 

2583  QAAOATTCrATTTAATTOAAACTCTC T WTC AOAAAOCAATAAL t TTOfCTCvmx.TOTTOIJOCTOAACCCTAAOOTO  3440 
3441  AOTOTOCAOTACAOTOrOTOrOOOTOAAATOOAOATTTOOAATTOAACrCTCTOCCTOTAAATOTTCCCCAAATAATTO  3739 
3740  TTOTOrOTATOATACOTOTATAATAAAAOTAl'fL'ITOtlAOAATTOAAAAAAAAAAAAAAAAAAA  3804 


CRP  1  m.WKITDNVKYEEIKEDRHKSSNOTPSVPKLSSK5HLYSP»PPliIITCTAEY(!PPPrPPPPra!IJlYS5SW:PYSHI/: 

AP2  KI.WELTIinrYE-DCEDIMD(?rS!«rr«iI«U;TVGI!SPYTSMP^mPtMroPP-YPPPmOPI-YPOS(!DPYSW™ 

EPF-1  ElYMMNPIJl-OPAPTCSMfi“<P<:-P(!SllEraCLPSHHCEPK:iJ.Pin,S<;LE«C»VSAPBI»YlU<SDI.ELPI»KAI.l» 

Si  S^rSp!lSMP---OHLpMKOEOES<;t^^P----TOE^^^^ . 

ERE-1  lalUlEKMUOTPHOMlEVOirroMKIiUlM^  LPCOEE-LVOAVMNPTEVFCSWRLSI. 

AP2  MLGD-LSIKSLPHAIEEVPHVEDPCINIPWTVIEPGPVSLSICSlKmVSMPINmnJOG^PK^E^E^l 

EEF-l  I^STSEYlCVTVAEVOPPLSPPECU»ELl«PnJU<AltSElira<SU<EH^Erc^ 

Sf  SSskIUtv»evqprlsppecu«isuyx:viju.«5|^^^ 

ERF-1  MDFRYVCEAEFPSRPVAEYLTRPKIlKRNEMA»RKHiaL»ROO«:REITRU^OORTPHCTSRLRPVLETOI^I-™F 

APJ  AROFi:YVCETEFPRRAVAEFUIRSH-SDPflEOVTRRl®ILL»TR(JICKEFTDLL»ODRSPEOTS^PI^PGIOSCI.Tm 


ERF-1  SLITH=FOS0«CRRVSAl»<YIRERlIVlI>ESni)INPGD0SPlU)Stl-ETI.EniEniRR 
RP2  [aiSHGFOSPAWMVm«B*YlTERlR«!>RMYlSSNPNSHTIWN»RSSI*EER^ 


58 


PROPRIETARY  DATA 


Figure  10.  In  vitro  synthesized  ERF-1 

A:  In  vitro  transcription/translation  from  the  ERF-1  cDNA  using  T3  polymerase 
(sense)  (lane  1)  or  T7  polymerase  (antisense)  (lane  3).  In  vitro  synthesized  ERF-1  was 
immunoprecipitated  using  AP2  polyclonal  antisera  (lane  2). 

B:  Gel  shift  analysis  of  in  vitro  synthesized  ERF-1  (lanes  2-13)  with  mutant  competitors 
compared  to  MCF7  nuclear  extract  (lane  1).  In  vitro  ERF-1  complex  supershifted  with 
AP2  antisera  (lane  13). 
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Figure  11.  Native  ERF- 1  complex  supershift 

MCF7  nuclear  extract  analyzed  by  gel  shift  with  AP2  antisera  (lanes  2-14)  with 
competitors  as  shown  compared  to  native  ERF-1  complex  (lane  1). 


61 


Figure  11 . 


AP2  Ab: 


Competitor: 


o  1-  (o  m  « 

llllllllllllll 


I  I  I  I  I  I  I  I  I 


Supershift  ^  HI 


ERF-1  - 


I  I  I  I 


llli  i 


PROPRIETARY  DATA 


Figure  12.  Competitive  gel  shifts  using  AP2a  with  a  panel  of  competitors 

Affinity  purified  AP2a  analyzed  by  gel  shift  with  competitors  as  shown.  OPT 
(GCCTGAGGG)  labeled  with  [a  -32p]  dCTP  was  used  as  a  probe  and  was  competed  with 
increasing  amounts  of  unlabeled  competitor.  Relative  binding  affinity  compared  to  OPT  is 
indicated  next  to  gel  shift  competitions  for  each  mutant  competitor. 
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Figure  13.  Competitive  gel  shifts  using  ERF-1  with  a  panel  of  competitors 

ERF- 1/GST  fusion  analyzed  by  gel  shift  with  competitors  as  shown.  OPT 
(GCCTGAGGG)  labeled  with  [a  -^^P]  dCTP  was  used  as  a  probe  and  was  competed  with 
increasing  amounts  of  each  unlabeled  competitor.  Relative  binding  affinity  compared  to 
OPT  is  indicated  next  to  gel  shift  competitions  for  each  mutant  competitor. 
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Figure  14.  In  vivo  AP2a  activation  of  OPT/IL-2  vs  M3T/IL-2  reporter  constructs 

A.  Western  blot  analysis  of  whole  cell  extracts  from  transfection  of  COS  cells  with 
AP2a  and  AP2  binding  site  reporter  constructs.  IL-2  reporter  construct  contains  only  the 
minimal  IL-2  promoter  and  is  a  measure  of  background  activity  in  the  cell.  OPT/IL-2 
contains  the  optimal  AP2  binding  site  (GCCTGAGGG)  while  M3T/IL-2  contains  a  mutated 
AP2  binding  site  (GCTTGAGGG).  B.  Gel  shift  assay  of  whole  cell  extracts  as  described 
above.  C.  Luciferase  assay  of  cell  extracts  from  transfections  described  above.  Corrected 
Luciferase  units  determined  as  described  in  text. 
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Figure  15.  In  vivo  ERF-1  activation  of  OPT/IL-2  vs  M3T/IL-2  reporter  constructs 

A.  Western  blot  analysis  of  whole  cell  extracts  from  transfection  of  COS  cells  with 
ERF-1  and  AP2  binding  site  reporter  constructs.  IL-2  reporter  construct  contains  only  the 
minimal  IL-2  promoter  and  is  a  measure  of  background  activity  in  the  cell.  OPT/IL-2 
contains  the  optimal  AP2  binding  site  (GCCTGAGGG)  while  M3T/IL-2  contains  a  mutated 
AP2  binding  site  (GCTTGAGGG).  B.  Gel  shift  assay  of  whole  cell  extracts  as  described 
above.  C.  Luciferase  assay  of  cell  extracts  from  transfections  described  above.  Corrected 
Luciferase  units  determined  as  described  in  text. 
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Figure  15. 
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Figure  16.  In  vivo  activation  of  ER3500-230LUC  by  AP2a  and  ERF-1 

A.  Luciferase  assay  of  cell  extracts  from  transfection  of  HepG2  cells  with  AP2a  and 
ERF-1  and  ER3500-230LUC  or  pGL2-BASIC.  B.  Luciferase  assay  of  cell  extracts  from 
transfection  of  MDA-MB-231  cells  with  AP2a  and  ERF-1  and  ER3500-230LUC  or  pGL2- 
BASIC.  Corrected  Luciferase  units  determined  as  described  in  text. 
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Figure  16 
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Table  1.  Summary  of  all  AP2a  binding  sites 


1  Position  1  1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

1  85 

0 

0 

46 

61 

67 

91 

1  37 

79 

A 

0 

0 

0 

22 

1  9 

55 

53 

1  9 

9 

T 

0 

0 

0 

46 

48 

1  9 

1  1 

1  3 

1  9 

C 

0 

1  85 

1  85 

71 

57 

44 

30 

1  6 

78 

G 

100% 

0% 

0% 

25% 

33% 

36% 

49% 

74% 

43% 

A 

0% 

0% 

0% 

12% 

10% 

30% 

29% 

10% 

5% 

T 

0% 

0% 

0% 

25% 

26% 

10% 

6% 

7% 

10% 

C 

0% 

1  00% 

100% 

38% 

31% 

24% 

16% 

9% 

42% 

G 

c 

c 

C/G/T 

G/C/T 

G/A/C 

G/A 

G 

G/C 

Number  of  each  nucleotide  found  in  each  position  listed  in  top  half  of  table.  Percent  of  each  nucleotide  at  each 
position  listed  in  bottom  half. 
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Table  2.  Summary  of  all  ERF-1  binding  sites 


1  Position  1  1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

1  46 

0 

0 

31 

58 

57 

93 

1  1  6 

63 

A 

0 

0 

4 

1  4 

46 

28 

1  2 

1  1 

T 

0 

0 

0 

46 

28 

8 

1  2 

9 

1  6 

C 

0 

146 

146 

65 

46 

35 

1  3 

9 

56 

G 

100% 

0% 

0% 

21% 

40% 

39% 

64% 

80% 

43% 

A 

0% 

0% 

0% 

3% 

10% 

32% 

8% 

8% 

T 

0% 

0% 

0% 

31% 

19% 

5% 

8% 

6% 

11% 

C 

0% 

100% 

100% 

45% 

31% 

24% 

9% 

6% 

38% 

G 

c 

c 

c/t/g 

G/C 

G/a/c  G  1  G  j 

G/C 

Number  of  each  nucleotide  found  in  each  position  listed  in  top  half  of  table.  Percent  of  each  nucleotide  at  each 
position  listed  in  bottom  half. 
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Table  3.  Summary  of  unique  AP2a  binding  sites 


Position 

1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

46 

0 

0 

20 

21 

20 

25 

40 

26 

A 

0 

0 

0 

1 

2 

1  8 

1  7 

4 

2 

T 

0 

0 

0 

1  5 

7 

4 

0 

1 

5 

C 

0 

46 

46 

1  0 

1  6 

4 

4 

1 

1  3 

G 

100% 

0% 

0% 

43% 

46% 

43% 

54% 

87% 

57% 

A 

0% 

0% 

0% 

2% 

4% 

39% 

37% 

9% 

4% 

T 

0% 

0% 

0% 

33% 

15% 

9% 

0% 

2% 

11% 

C 

0% 

100% 

100% 

22% 

35% 

9% 

9% 

2% 

28% 

G 

c 

c 

G/T/C 

G/C 

G/A 

G/A 

G 

G/C 

Only  clones  containing  a  single  binding  site  are  listed. 
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Table  4.  Summary  of  unique  ERF-1  binding  sites 


1  Position  1  1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

48 

0 

0 

1  2 

20 

20 

29 

40 

25 

A 

0 

0 

0 

0 

2 

1  5 

1  5 

6 

5 

T 

0 

0 

0 

1  3 

6 

3 

2 

2 

7 

C 

0 

48 

48 

23 

20 

1  0 

2 

0 

1  1 

G 

100% 

0% 

0% 

25% 

42% 

42% 

60% 

83% 

52% 

A 

0% 

0% 

0% 

0% 

4% 

31% 

31  % 

13% 

10% 

T 

0% 

0% 

0% 

27% 

1  3% 

6% 

4% 

4% 

15% 

C 

0% 

100% 

100% 

48% 

42% 

21% 

4% 

0% 

23% 

G 

c 

c 

C/t/q 

G/C 

G/a/c 

G 

G 

G/c 

Only  clones  containing  a  single  binding  site  are  listed. 


Table  5.  Summary  of  AP2  alpha  clones  from  revised  binding  site  selection 


1  Position  1  1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

1  9 

0 

0 

8 

1  8 

6 

27 

27 

1  1 

A 

0 

0 

0 

1 

1 

9 

0 

0 

0 

T 

0 

0 

0 

8 

2 

6 

0 

0 

0 

C 

8 

27 

27 

1  0 

6 

6 

0 

0 

1  6 

G 

70% 

0% 

0% 

30% 

67% 

22% 

100% 

1  00% 

41% 

A 

0% 

0% 

0% 

4% 

4% 

33% 

0% 

0% 

0% 

T 

0% 

0% 

0% 

30% 

7% 

22% 

0% 

0% 

0% 

C 

30% 

1  00% 

1  00% 

37% 

22% 

22% 

0% 

0% 

59% 

G 

c 

c 

C/T/G 

G 

N 

G 

G 

C/G 

Number  of  each  nucleotide  found  in  each  position  listed  in  top  half  of  table.  Percent  of  each  nucleotide  at  each 
position  listed  in  bottom  half. 
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Table  6.  Summary  of  ERF-1  clones  from  revised  binding  site  selection 


1  Position  1  1 

2 

3 

4 

5 

6 

7 

8 

9 

G 

21 

0 

0 

5 

1  4 

7 

25 

25 

1  3 

A 

0 

0 

0 

2 

4 

1  0 

0 

0 

0 

T 

0 

0 

0 

1  2 

3 

2 

0 

0 

0 

C 

4 

25 

25 

6 

4 

6 

0 

0 

1  2 

G 

84% 

0% 

0% 

20% 

56% 

28% 

100% 

100% 

52% 

A 

0% 

0% 

0% 

8% 

16% 

40% 

0% 

0% 

0% 

T 

0% 

0% 

0% 

48% 

12% 

8% 

0% 

0% 

0% 

C 

16% 

100% 

100% 

24% 

16% 

24% 

0% 

0% 

48% 

G 

c 

c 

T/c/q 

G 

G 

G 

G/C 

Number  of  each  nucleotide  found  in  each  position  listed  in  top  half  of  table.  Percent  of  each  nucleotide  at  each 
position  listed  in  bottom  half. 
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